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Abstract 



We consider exchange of three intervals with permutation (3, 2, 1). The aim of this 
paper is to count the cardinality of the set 3iet{N) of all words of length N which 
' appear as factors in infinite words coding such transformations. We use the strong 

, relation of 3iet words and words coding exchange of two intervals, i.e., Sturmian words. 

^ ' The known asymptotic formula ^2iet{N)/N^ ~ -i^ for the number of Sturmian factors 

'5 ! allows us to find bounds + o(l) < #?,iei{N)]N^ < 7^ + 

P.- 

1 Introduction 

^ ' The study of infinite words arising from an exchange of several intervals was initiated by 

OO , Rauzy, whereas dynamical systems connected with exchange of intervals has been already 

\ studied by Katok and Stepin [13j. The most explored subfamily of such words are Sturmian 

fSJ ' words corresponding to an exchange of two intervals [2p|. Infinite words coding exchange 

. of 3 intervals form the next interesting family. 

\ The special case of Sturmian words is very well understood. For surveys on the prop- 

erties of Sturmian words see [I6t i20j . 

Much less is known about codings of r intervals for r > 2. Combinatorial character- 
ization of the language of such words was given first for r = 3 in [TU], then for general 
^ ' r in [5]. Boshernitzan and Caroll have found a sufficient condition for substitutivity 

■ of an infinite word coding exchange of r intervals. The necessity of this condition in the 

special case r = 3 and reverse-order permutation was demonstrated in [1]. A criterion for 
substitution invariance of words coding exchange of 3 intervals with permutation (3,2,1) 
is given in [2] and [3]. 

Actually, infinite words coding exchange of 3 intervals with permutation (3,2,1), here 
called 3iet words, are subject of this paper. They stand out among interval exchange words 
by the fact that, similarly to Sturmian words, they can be geometrically represented by 
cut-and-project sequences. This was first remarked in |14j . then developed in [11]. Even 
more narrow connection between 3iet words and Sturmian words is displayed in [2]. This 
connection allowed us to estimate the number of factors of 3iet words using the following 
enumeration formula for Sturmian words, first given by Lipatov [15]; see also |17t [6]. 
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Theorem 1. !115\ \17[ {Bjj The number ^2iet{N) of Sturmian factors of length N is given 
by 

^ 1 
#2iet(7V) = 1 + y (AT + 1 - k)ip{k) = + 0{N^ log N) , 

k=l 

The aim of this paper is to show the following theorem. 
Theorem 2. The asymptotic growth of the number of 3iet factors of length N is given by 

AriV^ + 0(N^+^) < #3iet(iV) < + 0(N^+^) , 

for arbitrary 6 > 0. 

The proof of Theorem [2] is divided into two parts, the upper bound is given in The- 
orem [9] in Section \5\ the lower bound in Theorem [12] in Section [H Lemmas on Sturmian 
words used in these sections may be of independent interest. Sections [3] and H] describing 
properties of the Euler function and of Sturmian factors are auxiliary. 

Precise values of #3iet(A^) for < 10 are found in Section [71 

2 Definitions of interval exchange words 

Let ^ be a finite alphabet. A concatenation w = wqWi ■ ■ ■ Wn-i of letters Wi £ A is called 
a word of length \w\ = n. The number of occurrences of a letter a in the word w is denoted 
by \w\a- The set of all finite words over A together with the empty word is denoted by 
A* . We can also consider one- or bidirectional infinite words u = uqUi • • • € A^ , resp. 
u = ■ ■ ■ u^2U-iUqUiU2 • • • € A^. A finite word it; is a factor of an infinite word u = (un) if 
w = MiUj+i ■ ■ ■ lij+n-i for some i. The factors of an infinite word u form the language of 
n, which is denoted by C{u). We denote Cn{u) = C{u) PI A"". The factor complexity of u 
is the function C„ : N i— > N given by Cu{n) = #Cn{u). 

Sturmian words are aperiodic words with the smallest possible complexity, namely 
Cu{n) = n + 1 for all n E N. It is known [18j that Sturmian words on {0, 1} are exactly the 
lower and upper mechanical words, which means that each Sturmian word u = {un)nez G 
{0, 1}^ is defined by 

Un = [{n + l)a + (3\ — [na + (3\ or 
Un = [{n + l)a + P] - [na + /?] , 

where a G (0, 1) is irrational and /? G [0, 1). Another representation of Sturmian words is 
given by the two interval exchange. Define e = 1—a € (0, 1), where a is the parameter used 
above. Consider the intervals Iq = [0, e), Ii = [e, 1), /q U/i = [0, 1), and the corresponding 
transformation : [0, 1) i-^ [0, 1) defined by 

Jx + l-e for xG/o, 
\x — e tor X G ii . 

We can code the orbit of a point xq G [0, 1) by the infinite word Ui^^xo = {un)nez £ {0, 1}^: 

fo ifT^(.,)E/„, 

\i ifr»(xo)eA. 

The word Ue^xo is the upper mechanical word corresponding to a = 1 — e and (3 = xq. 
Similarly, lower mechanical words are obtained by coding exchange of intervals of the type 
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(•,■]. It is the representation of Sturmian words by two interval exchange which will be 
the most used here. 

Since the language of a Sturmian word does not depend on the initial point xq, but 
rather only on the parameter e, we can denote the language of u^^xq by >C(e) and the set 
of factors of length M of Us^xo by >CM(e)- 

We study infinite words which generalize Sturmian words in the way that they code 
exchange of three intervals. Let e,i be real numbers satisfying 



e G (0, 1) \ Q , max{e, 1 - e} < £ < I . 
The mapping T^^i{x) : [0,£) ^ [0,^) defined by 



(3) 



X + 1 — e for X € /yi 
X + 1 — 2e for X € /_B 
X — e for X € /c 



[i-l + e,e), 



(4) 



is called exchange of three interval^ with permutation (3,2,1). The orbit of a point 
xo S [0, i) under the transformation T^^i of ^ can be coded by the infinite word u^^e^xo = 
{un)n& in the alphabet {A, i?,C}, where 



A ifT^/xo)elA, 
B ifT^/xo)elB, 

[c ifr,Vxo)G/c. 



(5) 



A visual representation of a 3 interval exchange transformation can be found, e. g., in pj. 

Since e is irrational, the infinite word u is aperiodic. We call these words 3iet words. 
Again, the language of a 3iet word does not depend on the initial point xq, thus we denote 
it by C{£,1). The factor complexity of 3iet words depends on the parameter i in the 
following way: C{n) = 2n + 1, if i Z + eZ, otherwise it satisfies C{n) = n + const for 
sufficiently large n, see [1]. Words with the latter complexity are called quasisturmian and 
described for example in [HI [9]. 



3 Asymptotic behaviour of the totient function 

In determining the asymptotic behaviour of the number of 3iet factors, we shall strongly 
use asymptotic properties of the Euler totient function, 

ip{n) = #{k en\k<n, k±n}=nYl " -) ' 

p prime 

where k -L n means gcd(A;,n) = 1. In our formulas, we will use the Landau big O 
notation; instead of a function /(n), we write 0{g{n)) if there exists a constant K such 
that |/(n)| < K\g{n)\ for ah n G N. 

The proof of the first asymptotic formula for the totient function can be found for 
example in |19j : 

n 2 

^(^(/c) = —n^ + C)(n log n). (6) 
k=l ^ 

^Usually one defines exchange of three intervals of lengths a,/3,7 > satisfying a + f} + -y — 1. Our 
choice of parameters ^ implies relation a + 2/3 + 7 = 1, where e — a + l3 and £ — a + (5 + "/. 



3 



Using the first equality from Theorem [T] and this formula, one can easily derive the fol- 
lowing estimate found already in [15]: 



^ 1 

#2iet(iV) = 1 + y (iV + 1 - k)^p{k) = + OiN^ log N) . (7) 



k=l 

Yet another formula involving the totient function will be useful. It can be easily 
derived that for any q, numbers coprime to q are in a certain sense uniformly distributed, 
namely 

#{pen\p ±q, p<i} = + (!)(2'"(^)) , 

q 

where u;{q) is the number of prime divisors of q. From the result of Hardy and Wright [12] 
on the asymptotic behavior of Lo{q), it follows that 

#{p G N j p _L j <p<i} = ^^(^ - j) + C(g^) , for any <5 > 0. (8) 

4 Sturmian factors and Farey numbers 

As we have already mentioned, for every fixed irrational e € (0, 1) and fixed M G N, the 
number of factors of length M in the language C{e) of a Sturmian word is exactly M + 1. 
Obviously, the set >CM(e) does not determine e uniquely: the same set of M + 1 factors 
appears for uncountably many irrational e which form an interval. As shown in [T7], if 
for some irrational ei < £2 it holds that CmI^i) 7^ J~-Mi£2), then there exists a rational | 
such that £1 < I < £2 and q < M. Therefore the interval (0, 1) of admissible parameters 
£ is partitioned into small intervals determining classes of Sturmian words with distinct 
Cm{£)- The partition is given by the Farey fractions of order M, i.e., all the reduced 
fractions in [0, 1] with denominator smaller than or equal to M. It is useful to consider 
the Farey fractions as an ordered list 

■ fo = J<fl<---< fr-l < fr = J- 

The partition [0,1] = ^l=Q[fi, fi+i], where J^m = {/« | i = 0, . . . , r}, is essential for 
describing Sturmian factors of length M. We shall thus speak about classes of Sturmian 
words of length M determined by intervals (/j, /j+i), i = 0, . . . , r — 1. Every class contains 
M + 1 factors and of course, the same factor may belong to different classes. For M = 4, 
the division is illustrated in Figure [1] together with the corresponding sets of factors of 
length 4. 

11 1 9. ^ 







4 3 




2 


3 4 




0111 


0110 


0101 


0010 


0001 


0000 


1011 


0111 


0110 


0100 


0010 


0001 


1101 


1011 


1010 


0101 


0100 


0010 


1110 


1101 


1011 


1001 


1000 


0100 


1111 


1110 


1101 


1010 


1001 


1000 



Figure 1: Classes of Sturmian words of length 4. 

The possible numbers of letters and 1 in factors corresponding to an interval (/j, fi+i] 
can be easily determined as follows. 
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Lemma 3. Let M £N be fixed, and let fi, /j+i be two consecutive Farey fractions of order 
M. Let £ be an irrational in {fi, fi+i). Then for every factor w of length M of a Sturmian 
word with parameter e we have 

\uj\o e{[Mfi\, [Mf,\ + 1} , \w\i G {M - [Mf,\,M- [Mfi\ - l} . 

Proof. Denote a = 1 — e. If w = wqWi ■ ■ -wm-i is a factor of a Sturmian word with 
parameter e, there must exist /3 € [0, 1) such that 

Wi = [{i + l)a + p\ - [ia + p\ e {0,1} , for z = 0, 1, . . . , M - 1 . 

Therefore 

A4-1 

\w\i = ^ = [Ma + /3J € { lMa\ , [Ma\ + l} . 
i=0 

The language Cui^) is the same for all e G (/,, /j+i) and we have \_Ma\ = |_M(1 — e)J = 
M - \Me\ = M - [Me\ -1 = M - [Mfi\ - 1. □ 

Now consider words coding the transformations Tf. defined as in ([1]). If /j = |, then 

such a word is purely periodic and looks as • • • = where w^'^ is a primitive 

word of length q. Recall that a finite word v is said to be primitive if it cannot be written 
in the form v = vow ■ ■ -w = for any integer k > 2. Therefore the language of the 
periodic word coding the transformation Tf. has exactly q factors of length M for every 

M > q. Let us denote the set of such factors by L^. As it is derived in [17] . 

CM{e) = Lg U , for ah e G {fi, . 

We have jj^Cui^) = M+1, #L^^j = qi, and #L^j^^^ = gj+i, where qi, gj+i are denominators 
of Farey fractions fi, fi+i, respectively. Therefore we have 

# (ig n =qi + qi+i - M - 1 . (9) 

5 The upper bound 

The main tool which we use in this section is a link between 3iet words and Sturmian 
words over the alphabet {0,1} by means of morphisms croi,aio : {A,B,C}* i-^ {0,1}* 
defined by 

fjoi : A^0,B^01,C^l, 

(10) 

aio: A^O, B^IO, C^l. 
In p], the following statement is proved. 

Theorem 4. A ternary word u is a 3iet word if and only if both o"oi('u) and aio{u) are 
Sturmian words. 

Based on this strong relation, we define the notion of 6-amicability. 

Definition 1. Let w^^^w^"^^ be finite words over the alphabet {0, 1}. We say that w^^\ 
^(2) form a 6-amicable pair, if there exists a 3iet factor w over {A,B,C} with exactly b 
letters B such that w^^'^ = aoi{w) and w^'^^ = aiQ{w). 
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Figure 2: Finite words w^^^ = 0100101 and w^'^^ = 0101001 forming a 1-amicable pair. 
The corresponding ternary word is w = AC ABAC. 



The notion is ihustrated on Figure [21 Note that due to Theorem (U both words of 
a 6- amicable pair are Sturmian factors from the same language C{e) for some irrational 
e G (0,1). A 3iet factor w of length with precisely b letters B corresponds to a b- 
amicable pair of Sturmian factors of length N + b. Obviously, different 6-amicable pairs 
give rise to different 3iet factors. Therefore we can express the number of 3iet words as 
the number of corresponding 6- amicable pairs. 

Proposition 5. Let iV G N. Then 

N N 
#3iet(A^) = ^ #3iet(A, 6) = ^ # b-amicable pairs of length {N + b) , 

b=0 b=0 

where 3iet(A, b) denotes the set of 3iet factors of length N with precisely b letters B. 

Let us focus on 5-amicable pairs of Sturmian factors. First, it is obvious that words 
which form a 6-amicable pair must both contain at least b letters and b letters 1. As a 
consequence of Lemma [31 we have the following statement. 

Corollary 6. Let 6, M G N and let e be an irrational in where are 

consecutive Farey fractions of order M. If Cm{£) contains a b-amicable pair, then 



6<min{LM/iJ + 1,M- LM/ij} 



Lemma 7. Let Z^^b be the family of classes of Sturmian words of length M containing 
at least one b-amicable pair. Then for M < 2b we have ^Z^i^i = 0, and otherwise, 

#Zm b < 4t(M - 2b)M + 0(M^+^) , for arbitrary 6>0. 

Proof. Obviously, words forming a 6-amicable pair must be of length at least M > 26. 
Consider an interval [/j, /i+i) such that the language Cm{£) for e £ {fi, fi+i) contains an 
amicable pair and fi = ^, p ^ q. Corollary [H then implies that b must satisfy 

b<M- + l and b < M - M- + 1 , 



providing bounds on p, namely, 

6-1 



<P< 



Thus, we are looking for the number 



M 

E 

q=l 



if[pen\p±q and q!^ < p < q{l - h^)^ 
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With the use of ([8]) we can write 

M 



9=1 

M - 26 + 2 ^ ^(g) + 0(Mi+^) for all 5 > 0. 



M 

9=1 



Together with the asymptotic behavior of X]g=i vC?) given in ([6]), this implies the state- 
ment of the lemma. □ 

Lemma 8. Let e e (0,1) be irrational. Let 6, M € N, 2& < M. Then the number of 
distinct b-amicable pairs in class Cm{£) is at most M — b. 

Proof. The language Cm{£) contains M + 1 factors of length M. Choose such a factor w 
and consider the set of points x G [0, 1) such that the orbit {Ti (x)}, j = 0, 1, . . . , M — 1, is 
coded by w. It can be easily shown that this set is a subinterval of [0, 1) of the type [■,•). 
Thus [0, 1) is divided into M + 1 disjoint subintervals; the division is done by M division 
points Te •'(e), j = 0, 1, . . . , M — 1. Let us denote these intervals Jj, i = 1, 2, . . . , M + 1, 
ordered so that x < y for any x ^ Ji and any y G Ji+i- The corresponding words of 
length M are denoted by w^"^ , i = 1, 2, . . . , M + 1. Note that T^{Ji) is an interval for all 
J = 0,1,...,M-L 

Let us first find an upper bound for the number of 1-amicable pairs of Sturmian factors 
of length M. The following procedure is illustrated on Figure El Take two consecutive 
intervals, Jj, Jj+i for some i = 1, . . . ,M. It means that Jj U Jj+i is an interval which 
contains in its interior exactly one of the division points, say T~^{e). If /c = 0, then the 
word w^*) starts with and the word starts with 1. More generally, realize that 

words w^"^^ and it;^*"^^^ have a common prefix v of length k and the set T^{Ji U Jj+i) is still 
an interval. Since T~^{e) G (Jj U Jj_|_i)°, we have e G T^[{Ji U Jj+i)"), and therefore the 
word w^*-* has prefix vO and the word u;^*"*"^) has prefix vl. 

If k = M — 1, then w^^^ = vO and = vl and they are not 1-amicable. If, by 

contrary, k < M — 1, then T^^^{Ji) = T^{Ji) + 1 — e is an interval of the form [•, 1), and 
r''+^(Jj+i) = r''(Jj+i) - e is an interval of the form [0, •). We further have 

r'=+2(j.) = r'=+i(j.) _ e = T\ji) + l-2e, 

T''^\J^+l) = T^^\J^+l) + 1 - e = T\J,+^) + 1 - 2e . 

and therefore the set T^{Ji U Ji+i) is again an interval for all j, A; + 2 < j < M — 1, and 
thus the words w^'^'^ and have a common suffix v' of length M — k — 2. Therefore 

we have w^^^ = vOlv' and w^'''^^^ = vWv' and the words w^^^ and w^^'^^^ form a 1-amicable 
pair. Together, we have a 1-amicable pair of words u)^*-* and tt;('+^) for i = 1, . . . , M, i ^ k. 
Their number is therefore M — 1. 

The previous considerations imply that the words i = 1, . . . , M + 1, are ordered 
lexicographically, where consecutive words differ only by interchange of 01 10 at one 
place. The only exception is the pair of words w^^^ and w^*"*"^-* whose intervals Jj, Jj+i 
are separated by the iteration T^^'^^^{e). These words are of the form w^^^ = vO and 
^(«+i) = yi^ which corresponds to 'half of the interchange 01 10. 

From that, it is clear that 6-amicable pairs can only be the pairs of words w^^\ w^''~^^\ 
i = 1, . . . , M + 1 — b, with exceptions, namely those indices i, such that the interval 
Jj U ■ ■ ■ U Jj+b contains in its interior the iteration T^^^^^{e). However, since we do not 
know the position of this point, it is difficult to determine the number of these exceptions, 
but it is at least one. Therefore the number of 6-amicable pairs is at most M — b. □ 
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Figure 3: Iterations of consecutive intervals Jj, Jj+i show that the corresponding words 
form a 1-amicable pair. 

Now we are ready to prove the upper bound from Theorem [2j 
Theorem 9. The number of 3iet factors of length N satisfies 

#3iet(iV) < + 0(N^+^) 

for arbitrary 6 > 0. 

Proof. Combining Proposition [5] and Lemma El we see that 

N N 

#3iet{N) = #3iet{N, 6) = ^ # 6-amicable pairs of length {N + b) < 

6=0 6=0 

N 

< # 6-amicable pairs of length (A^ + b) in the class /C < 

6=0 /C6Z]v+6,6 

N 

<Y,NifZN+b,b. 

6=0 

Putting in the estimate on #^Ar+;,,6 from Lemma [71 we obtain 

N N 

#3iet(iV) <NY—{N-b){N + b) + NY,0{iN + b)'^+^) = 

6=0 ^ 6=0 

= i.N( N^(N + 1) - -N(N + l)(2N + I)) + 0(iV3+'^) = 

TT^ V 6 / 

6 The lower bound 

Consider the following geometric representation of Sturmian and 3iet words by cut-and- 
project sequences. Given a strip in the Euclidean plane parallel to the (irrationally ori- 
ented) straight line y = ex, take all points of the lattice and project them orthogonally 
to the straight line. It is known [141 [TT] that in such a way, one obtains a sequence of 
points with two or three distances between them. If the distances are two, say Ai, A2, we 
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can code them by two letters, and the resulting word is a Sturmian word. If the distances 
are three, they are of the form Ai, A2, Ai + A2. Coding these distances by three letters, 
we obtain a 3iet word. Moreover, Sturmian word is obtained only for a discrete set of 
values of the width of the considered strip. For details about this construction we refer 

to [n]. 

On the other hand, any Sturmian word xn or a 3iet word Ugr £ xn can be represented 



in this way, the parameter e being the slope of the straight line and 
the width of the strip. If without loss of generality we take e € (0, 1) 
strips giving rise to Sturmian words form a sequence 

• • • < max{e, 1— e}<l<l + e<--- 

All parameters i in between these values yield 3iet words. 



! corresponding to 
the widths of the 




Figure 4: Geometric representation of Sturmian and 3iet words by cut-and-project se- 
quences. 

This geometric representation is illustrated in Figure HI where £1 < £2 are two con- 
secutive discrete values for which the cut-and-project scheme yields a Sturmian word. 
Projections of points in the strip Si with parameter £1 correspond to a Sturmian word 



u 



(1) 



over the alphabet {A,B}, where A stands for the projection of the horizontal side 
and B of the diagonal of a unit square. The strip 5*2 corresponding to the value £2 gives 
a Sturmian word n*^^^ over {A, C}, where again, A is the projection of the horizontal side 
and C of the vertical side of the unit square. Note that the projection of a diagonal is the 
sum of projections of the vertical and horizontal sides. This corresponds to the fact that 
the infinite word u^"^^ arises from u^^^ replacing every B by AC. 

Any strip S such that Si C S" C ^2 gives rise to a 3iet word over the alphabet {A, B,C} 
where only some letters B have been replaced by AC. Enlarging the strip S from Si to 
5*2 causes more and more lattice points to fall into S and thus to split more and more 
distances B into CA. This procedure is illustrated in Figure [5] on a finite segment taken 
from Figure m 

Let now w^^^ G {A,B}* be a Sturmian factor of length M with at least b letters B, 
b < M. By a suitable choice of the enlarged strip S, we obtain from w^^^ a 3iet factor w of 
length M + b containing exactly b letters C . Such a word w satisfies > \w\c, where 
the equality holds only \iw = [CA)^'^. Since the role of letters A and C is symmetric over 
the set of all 3iet factors, the interchange A ^ C \n the factor w gives again a 3iet factor. 
Denoting by the number of Sturmian factors of length M with at least b letters B, 
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ABAABABA ABAACAABA ACAAACAABA ACAAACAACAA 



Figure 5: Construction of 3iet factors with given number of letters C by substituting AC 
for suitable letters S in a Sturmian factor. 

we can summarize the above considerations by saying that #3iet(A^) > 2^'^^^ HN^h,b- 
Moreover, realize that a Sturmian factor with at least b letters B is of length at least b, so 
the sum is taken only over b such that N — b > b, i.e., b < \_N/2\. This gives the following 

Proposition 10. Let N eN. Then 

L7V/2J 

#3iet(A^) > 2 ^ Sturmian factors of length {N — h) with at least b letters B . 
b=o 

To estimate the number in the right part of the equation, we use the relation 1^. Let 
us mention that an asymptotic formula for the number of Sturmian factors w with given 
\w\ and was derived in [4J. 

Lemma 11. The number HM,b of Sturmian factors of length M with at least b letters 1 
satisfies 



1 



HM,b >^{M- b)M^ + 0{M'^+^) , for any 6 > . 



Proof. Let fi be the i-th Farey fraction fi = Consider the set L^^]. By Lemma El 

(i) 

if 6 < M — Mfi — 1, then every factor in L\j has at least b letters 1. The inequality 
M — Mfi — 1 >b can be rewritten as 



M -b-1 , , 

Vi < Qi ■ (11) 

By ([8]), the number of Farey fractions with the same denominator q = qi < M satisfy- 
ing PT]) is equal to 

+ Oiq') , for any <5 > . (12) 

Since the values of Farey fractions increase with i, there exists a maximal index, say 
s, such that every factor in the set IJi=o -^iv/ sufficient number of letters 1. Therefore 



HM,b > # U ■ 



In order to determine the cardinality of this union, we write it as a disjoint union, namely 



U^2 = ^!!? u u(^s\^r^ 

1=0 i=l 

Since i^Lj^j = qi and ij^L\i = and equality ([9]) holds, we obtain 



if[Ll>\Llj'>)=M + l~q,.^ 



10 



which imphes 

s s 
HM,b > 1 + + 1 - qi_i) = ^(M + l-qi)-M + qs. 

i=l i=0 

We group summands with the same values of qi = q. The number of such summands (for 
each q) is given by ()12p . Therefore 

M M -h-^ 

HM,b > ^{M + 1 - q) ^{q) + 0{M'-^') = 

9=1 

= -T7 2 + 0{M'+') , 

M vr^ 

where we have used ([7]). The result easily follows. □ 

The following theorem provides slightly better lower bound on #3iet(A^) than that 
announced in Theorem [21 Instead of the constant | we give > |- 

Theorem 12. The number of 3iet factors of length N satisfies 



#3iet(iV)>^iV4 + 0(iV3+5), 

for arbitrary 5 > 0. 

Proof. Combining the formula from Proposition [10] and Lemma [TTl we have 

lN/2\ lN/2\ lN/2\ 

#3iet(A^) > 2 HN-b,b >^Y{N-2b){N- bf + 0{{N - bf+^) 

6=0 6=0 6=0 

2 VN/2\ 



7r2 



6=0 



TT^\ 2 2 3 4/ ^^ 

'^^'^1-1 + ^-^)+ 0{N' log N) = ^N' + 0{N^^') . □ 



7r2 V2 2 24 32/ ' ^ ' 48tt'^' 



7 Precise values for small lengths 

We have seen for Sturmian words, that given € N, the interval (0, 1) of possible values 
of the parameter e was divided into X^^Li ^{f^) areas such that all Sturmian words with 
parameter within one area had the same set of A^+1 factors of length A^. Similarly, one can 
expect that the family of possible pairs of parameters e G (0, 1) and i € (max{e, 1 — s}, l) 
will be divided into regions with the same set of factors of length N. The number of factors 
of a 3iet word with parameters in the interior of the regions will be equal to 2N + 1. 

Let us describe the way to obtain the list of factors of length N for fixed parameters 
e, i. Denote di = e, d2 = i — l + £, the discontinuity points of the transformation T^^. The 
domain of T^/, namely [0, £), is divided by points T~^{di), T~^{d2), j = 0,1, . . . , N — 1 
generically into 2N + 1 subintervals. (This happens for i ^ Z + Ze and for small N also 
for other i. Otherwise, some of these iterations coincide and the number of subintervals 
is smaller.) Each of these subintervals corresponds to one factor of length N occurring in 
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the language of the infinite word u^/,xo ^^t any xq. The ordering of iterations T^^{di), 

T^l{d2) in [0,^) depends on parameters e,£; different orderings give rise to different fists 
of 3iet factors. Values of e,£ providing the same ordering are given by finear inequafities 
in e, i. Figure sfiows tfie division of the region of parameters e, I by these inequafities 
for factors of length 2. The set of all 3iet factors of length 2 is the union of lists given in 
the figure, i.e., 

3iet(2) = {AA, AB, AC, BA, BB, BC, CA, CB, CC} , and #3iet(2) = 9 . 




AC, BC, CA, CB, CC 
AC, BB, BC, CA, CB 
AB,AC, BA,BB,CA 
AA,AB,AC, BA,AC 



Figure 6: Division of the region of parameters e, I by inequalities for factors of lengtfi 2 
and tfie corresponding lists of factors. 

In Figure [71 we give tfie same analysis for factors of length = 3. Note that the 
region and its division into areas according to factors of given length must be symmetric 
with respect to the axis e = 1/2. This corresponds to interchange of letters A^ C all 
the factors. Figure [7] shows only one half of the picture. 




Figure 7: Division of the region of parameters e, £ by inequalities for factors of length 3. 



12 



The lists of factors of length 3 in individual areas of Figure [7] are given as follows. 

Qi : AAC, ABA, AC A, BAC, CAA, CAB, CAC 

^2 ■■ ABA, ABB, AC A, BAC, BBA, CAB, CAC 

: ABB, AC A, BAC, BBA, BBB, CAB, CAC 

: ABB, AC A, BAB, BAC, BBA, BBB, CAB 

: ABA, ABB, AC A, BAB, BAC, BBA, CAB 

: AAC, ABA, AC A, BAB, BAC, CAA, CAB 

: AAB, AAC, ABA, AC A, BAA, BAB, CAA 

Qs ■■ AAA, AAB, AAC, ABA, AC A, BAA, CAA 

The lists of factors in areas Og, . . . , fiig not shown in Figure [7| are obtained from ili, . . . , rig 
by the interchange A ^ C . The set of all 3iet factors of length 3 is therefore equal to 

3iet(3) = {A, B, Cf \ {ABC, CBA} 

and hence #3iet(3) = 25. 

A computer evaluation provides the following table of values #3iet(A^) for A'^ = 
1,...,10. 



N 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


#3iet(iV) 


3 


9 


25 


55 


113 


199 


339 


531 


809 


1165 


7r2#3iet(iV) _ 

iV4 


29.6 


5.55 


3.05 


2.12 


1.78 


1.52 


1.39 


1.28 


1.22 


1.15 



As we have proved above, asymptotically, the values from the last string of the table 
should fall between 17/48 and 2. It seems from the table that these bounds start to hold 
quite early, and perhaps it is possible to improve them. 
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